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2 (57) Abstract: A robust watermark embedded (20) into a Direct Stream Digital (DSD) audio signal including a flat frequency re- 
sponse in a specific frequency range which does not extend below 20 kHz or above 100 kHz. The watermark is therefore hidden in 
the noise spectrum of the DSD signal, such that the watermark is inaudible to a listener. Since the noise spectrum contains important 
information that helps provide the DSD signals with sharp transients and an accurate impulse response, the watermark cannot be 
removed from the DSD signal without causing significant degradation to the signal's audio quality. 



BNSDOCID: <WO 03023777A1J_> 



wo 03/023777 
A robust watermark for DSD signals 



PCT/IB02/03550 



BACKGROUND OF THE INVENTION 

HELD OF THE INVENTION 

The present invention relates generally to the protection of intellectual 
property, and more specifically, to a robust digital watermark that is encoded into a Direct 
Stream Digital (DSD) signal. 

DESCRIPTION OF THE RELATED ART 

When the compact disk (CD) was first developed. Pulse Coding Modulation 
(PCM) was the best available technology for encoding a digital audio signal to be recorded 
on a disk. The standard CD format has a 16-bit resolution and sampling fi-equency of 44.1 
kHz. 

As a result of advances made m professional recording techniques, die 
resolution and sampling rate of the standard CD format is no longer adequate to preserve the 
audio quality of the recordings being made at various digital studios. 

Fig. lA illustrates the typical audio spectrum of the standard CD signal. As 
shown in this figure, the signal band of standard CD strongly diminishes after 20 kHz. 

Super Audio Compact Disk is a high-resolution audio CD format, which 
utihzes Direct Stream Digital (DSD®), instead of PCM, for recording audio signals. The 
DSD recording format directly records the 1-bit signal, output from a sigma-delta analog-to- 
digital (A/D) converter at a sampling rate of 2.8224 MHz. The sigma-delta A/D converters 
used in DSD include noise shaping filters to effectively shift the noise out of the audio band, 
i.e., shift the noise well above 20 kHz where it is inaudible. 

DSD avoids the degradation in signal quality that may result from the 
decimation process and interpolation processes used in PCM. DSD recordings have an 
improved firequency response, which is broadened to 100 kHz. In addition, the DSD 
firequency response has a dynamic range of 120 dB when noise shaping filters of fifth (or 
higher) order are included in the sigma-delta A/D converter. Fig. IB Ulustrates the audio 
^ectrum of DSD signals. 
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Although hijinan ears generally do not hear distinct tones over 20 kHz, the 
higher frequencies do contain audio information of a transient nature (i.e., the way sounds 
start and stop), which is perceived by the human brain and provides a more "natural" feeling 
to the music. By preserving the information in this high-frequency band (i.e., in the noise 
5 spectrum), DSD technology provides a more accurate impulse response and sharp transients, 
which is a significant reason for the superior audio quality of DSD recordings over the 
standard CD recordings. 

Along with audio quality, another important issue that must be considored 
with respect to digital recording technologies is protection of the recorded content from 
10 illegal copying and distribution. One mechanism for protecting digitally recorded intellectual 
property (IP) is an encoded wateimaik. 

A watermark comprises identification data embedded within digital content, 
which may specify the ownership and the intended recipient of the content, and help enforce 
restrictions on usage (e.g., copying and distribution) of the content. Watermarks can be used 
15 to identify illegal copies of digital content, detect illegal changes in the content. 

Some watermarks are used as an attempt to prevent unlawful copying and 
distribution of digital IP. For instance, certain types of CD recorders may contain a special 
copy protection mechanism, which checks for a watermark within the content to be recorded. 
If a watermark is detected that indicates that the copying is not permitted or that the content 
20 has been illegally obtained, the mechanism disables the recording operation of the device. 

Alternatively, the purpose of a watermark may be to identify flie unlawfiil 
copying and distribution of content, in order to allow the rights owners to seek legal 
reparations from the infringing parties. 

A watermark generally comprises a signal encoded into digital content that 
25 fiilfiUs certain statistical properties, such that it can be extracted from the content in its 

original form. However, the addition of such a signal usually causes some degradation to the 
sound quality of digitally recorded audio signal. It would be advantageous to provide a 
watermark, whose presence does not cause any pCTceivable degradation in the sound quaUty 
. of an audio recording. 

30 It would fiirther be advantageous to provide a watermark that is robust, i.e., 

difficult to remove or modify without severely affecting the quality of the audio signal in 
which it is embedded. Accordingly, the watermark must be placed in perceptually significant 
components (e.g., frequency bands) of the audio signal. 



BNSDOCID: <WO ^03623777A1J_> 



wo 03/023777 PCT/IB02/03550 

3 

SUMMARY OF THE INVENTION 

The present invention provides a robust watermark for audio signals recorded 
in DSD format, which does not cause a significant degradation of the audio quality. 

An exemplary embodiment of the present invention is directed to a robust 
5 watermark embedded into a DSD audio signal, such that the watermark has a flat firequency 
spectrum in a specific firequency range that is above 20 kHz. The fi-equency response of the 
watermark is substantially zero at the band of fi-equencies below 20 kHz. 

In a finther exemplary embodiment of the present invention, the specific 
firequency range of the watermark does not extend above 100 kHz. 
10 In a fiurther exemplary embodiment of the present invention, spread-spectrum 

watermarking is used to encode the robust watermark into the DSD audio signal. 

In a fiirther exemplary embodiment of the present invention, the specific 
firequency range of the robuist watemiark is chosen to be low, such that the watemiark is 
placed in a perceptually more significant firequency band and removal of the watermark will 
1 5 result in significant degradation of tbe audio quaUty. 

In a fiirther exemplary embodiment of the present invention, the specific 
firequency range of the robust watermark is chosen to be high, such that the degradation of 
audio signal caused by the presence of the watermark is low. 

Advantages of the present invention will become more apparent fi"om the 
20 detailed description provided hereafter. However, it should be understood that the detailed 
description and specific examples, while indicating preferred embodiments of the present 
invention, are given by way of illustration only, since various changes and modifications 
within the spirit and scope of the invention will become apparent to those skilled in the art 
firom tiiis detailed description. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will become more fidly understood firom the detailed 
description given below and the accompanying drawings, which are given for purposes of 
illustration only, and thus do not limit the invention. 
30 Fig. 1 A illustrates the typical audio spectrum of a standard CD signal. 

Fig. IB illustrates the audio and noise spectrum of a DSD signal. 

Fig. 2 A is a block diagram of a DSD signal recording apparatus according to 
an exemplary embodiment where an analog watermark signal is embedded into an analog 
audio signal, prior to the encoding of the audio signal into a DSD signal. 
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Fig. 2B is a block diagram of a DSD signal recording apparatus according to 
an exemplary embodiment where a digital watermark signal is embedded into a DSD 
encoded audio signal. 

Fig. 3 is a block diagram of a watermark embedder illustrating an exemplary 
5 embodiment where the SSW technique is used to embed a robust watermark into an audio 
signal. 

Fig. 4A illustrates the spectrum of a robust watermark whose frequency band 
is chosen for greater robustness. 

Fig. 4B illustrates the spectrum of a robust watermark whose frequency band 
1 0 is chosen for lower degradation of sound quaUty . 

Fig. 5 illustrates a sigma-delta modulator according to an exemplary 
embodiment in which a soft watermark is inserted the DSD signal. 



DETAILED DESCRIPTION OF THE EXEMPLARY EMBODIMENTS 
15 The present invention provides a watermark, encoded into a specific frequency 

band of a DSD audio signal, which preferably does not extend below 20 kHz or above 100 
kHz. Since the human ear does not normally perceive harmonics above 20 kHz, the digital 
watermark does not degrade the soimd quality of the recording by any significant amount. 
However, any efforts by individuals to remove this watermark by low-pass filtering the 
20 recorded signals above 20 kHz will result in significant degradation, such that the sound 
quality of the DSD recording will be no better than standard CD recordings. 

Fig. 2 A is a block diagram of a DSD signal recording apparatus 100 according 
to an exemplary embodiment where an analog watermark signal is embedded into an analog 
audio signal, prior to the encoding of the audio signal into a DSD signal. In the recording 
25 apparatus of Fig. 2A, an analog audio signal and an analog watermark signal is input to a 

watermark embedder 20. The output of the watermark embedder 20 is connected to a sigma- 
delta modulator 40. The output of the sigma-delta modulator 40 is sent to a DSD signal 
recorder 50. 

The operation of the DSD signal recording apparatus 100 of Fig. 2A is as 
30 follows. ITie watermark embedder 20 inserts the watermark signal into the audio signal in a 
specific frequency range. According to an exemplary embodiment, this range is chosen such 
that the lower frequency limit does not extend below approximately 20 kHz and that the 
higher frequency limit does not extend higher than approximately 100 kHz. According to 
alternative embodiments, however, that the lower limit of the frequency range can be set 
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below 20 kHz in order to obtain a more robust watermark at the expense of sound quality. A 
more detailed description of an exemplary watermark embedder 20 will be given below in 
coimection with Fig. 3. 

The watermarked signal output jBrom the watermark embedder 20 is sent to a 
S sigma-delta modulator 40, which quantizes the watermarked signal into a 1-bit DSD encoded 
signal. The sigma-delta modulator 40 uses a negative feedback loop to accumulate the DSD 
encoded bitstream. The accumulated signal is sent to a fifth-order noise-shaping filter. The 
output of this filter 42 is quantized into a "1" or "0" by quantizer 44. The 1-bit output of 
quantizer 44 is looped back via a one-sample delay unit 46 and a digital to analog signal 

1 0 converter 48. The bitstream output of the sigma-delta modulator is recorded by the DSD 
signal recording apparatus 50, for example, on a SACD disk. 

According to an altemative exemplary embodiment, the recording apparatus 
100 may first encode the analog audio signal into a DSD signal before inserting the 
watermark. Fig. 2B is a block diagram of a DSD signal recording apparatus according to this 

1 5 altemative exemplary embodiment. 

In Fig. 2B, the analog audio signal is inserted into a sigma-delta modulator 40. 
The DSD encoded signal output firom the sigma-delta modulator 40 is input into the 
watermark embedder 20, whose output is coxmected to DSD signal recorder SO. 
In the recording apparatus of Fig. 2B, the watermark comprises a bit pattern which is 

20 encoded into a specific frequency band of fiie DSD audio signal, where the specific 
frequency band does not substantially extend below 20 kHz or above 100 kHz. 

According to an exemplary embodiment, the watermark is inserted into the 
DSD signals through a process called Spread Spectrum Watermarking (SSW). This process is 
analogous to spread spectrum wireless commvinications, in which a narrowband signal is 

25 transmitted over a much larger bandwidth such that the energy in any single frequency is 
imdetectable. The SSW process spreads the wateraiark signal across a wide frequency band 
of the digital content, where the frequency band is perceptually significant. 

When applying a watermark to an audio signal, the SSW process spreads the 
watermark over frequency components that contribute to the overall sound quality of the 

30 signal. Since the watermark is hidden across a much wider frequency band than is needed, it 
will take the form of a relatively weak noise in the content that is virtually undetectable by 
individuals wishing to remove the watermark. Therefore, the only way that the watermark 
can be removed is by filtering out the significant frequency components of an audio signal. 
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and thereby causing noticeable degradation to the sound quality. Accordingly, the watermark 
of the SSW process can be very robust 

In spread spectrum watermarking, the watermark signal is inserted into the 
audio signal using a select set of frequency sub-bands, within the given frequency range for 
5 the watermark- Using a scheme known as frequency hopping, the frequency sub-band of the 
watermark changes abruptly, many times each second. In this scheme, the watermark signal 
may be modxilated with a generated pseudo-noise signal and thra scaled to a relatively low 
power level. 

Since the frequency sub-band of the watermark at each point in time is 

10 determined pseudo-randomly, it appears in the frequency domain as a weak noise. However, 
the watermark verification process knows all of the frequency sub-bands where the 
watermark is embedded, such that it is possible to combine these relatively weak signals into 
a single signal with a high Signal to Noise Ratio (SNR). The combined signal can then be 
verified by examining its content. 

15 Fig. 3 illustrates a watermark embedder 20, which embeds a robust watermark 

into an audio signal using the SSW process. A modulator 24 modulates flie watermark with a 
pseudo-noise signal generated by the pseudo-noise generator 22. According to an exemplary 
embodiment of the present invention, the pseudo-noise signal is generated such that it spreads 
the watermark within a specific frequency range that does not extend substantially below 20 

20 kHz or substantially above 100 kHz. The modulated watermark signal is passed through 

ampUfier 26 where it is scaled to the appropriate level. The signal is then combined with the 
input audio signal usmg addition element 28, resulting in a watermarked audio signal. 

Should the watermark embedder 20 of Fig. 3 be used in connection with the 
recording apparatus of Fig. 2A, the input audio signal and the watermark signal would then 

25 comprise analog signals. Alternatively, if the watermark embedder 20 of Fig. 3 is to be used 
in conjunction with the recording apparatus of Fig. 2B, then the input audio signal comprises 
a DSD encoded signal and the watermark comprises a digital bit pattern. 

It should be noted that other SSW methods, including sub-band decomposition 
watermarking techniques, could be used to insert the watermark in the DSD signal. In 

30 addition, the present invention is in no way limited to using the spread spectrum 

watermarking process to watermark the audio signal. Other methods for encoding a 
watermark into a specific frequency range of a DSD audio signal may also be used, as will be 
contemplated by those of ordinary skill in the art. 



BNSDOCID: <WO__03023777A1J_> 



wo 03/023777 PCT/IB02/03550 

7 

According to an exemplary embodiment of the present invention, the specific 
frequency range in which the watermark is embedded into the DSD signal is chosen to be 
rather low. For example, the jfrequency range may configured to be approximately between 
20 and 30 IdEIz. Fig. 4A illustrates the firequency spectrum of a watermark at such a firequency 
5 range. Any attempts to remove this watermark by low-pass filtering will result in a noticeable 
degradation in the audio quality of the DSD signal. This is because the low-pass filtering will 
remove all transient information from the audio signal at the frequency components above 20 
kHz. Also, low-pass filtering the audio signal at a cutoff frequency of 20 kHz may further 
result in a blocking of harmonics that can be perceived by a Ustener. 

10 As discussed earlier, if all signals above 20 kHz were removed from a DSD 

recording by low-pass filtering, the sound quality of the resultant signal would be no better 
than the standard CD recordings. Accordingly, it would not be worth the effort for to illegally 
remove the watermark and make copies of a SACD disk recording, since the copies will not 
come close to matching the quality standards of SACD. 

1 5 Although this embodiment has been described with respect to a digital 

watermark whose firequency range is between about 20 and 30 kHz, it should be noted that 
this range is merely exemplary of the general principle that digital watermarks having a lower 
given frequency range will result in much greater degradation to the DSD audio signal if they 
are removed. 

20 There is a disadvantage, however, in choosing a low frequency range for the 

watermark. Specifically, the mere presence of such a watermark in the DSD signal causes 
some degradation to the sound quality. In the discussion above with respect to SS W process, 
it was noted that the watermark takes the form of noise in the audio signal. Such noise may 
affect the audio quality, even if its frequency range is a little bit above 20 kHz. 

25 According to another exemplary embodiment of the present invention, the 

frequency range of the watermark is chosen to be at a high frequency range, such that its 
presence does not adversely affect the DSD signal. Fig. 4B illustrates an example of a digital 
watermark whose firequency range is approximately between 90 and 100 kHz. Since the 
transient information at the frequency components closer to 100 kHz is not as easily 

30 perceived by a listener, the degradation caused by a watermark inserted in this frequency 
range is not at all significant. 

Accordingly, the example of Fig. 4B is illustrative of the general principle of 
this embodiment, namely, that a watermark inserted in a higher frequency range of a DSD 
signal causes less degradation to the sound quality. However, using a high frequency 
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watennark does have the disadvantage that it can be removed by low-pass filtering the DSD 
signal with a higher cutoff firequency, resulting in less degradation to the signal. 
According to a further embodiment of the present invention, a robust watermark, as described 
with respect to the embodiments above, may be embedded in a DSD audio signal, which also 
5 contains a soft watennark. 

The concept of soft watermarking is described in U.S. Patent No. 6,157,330, 
the subject matter of which is herein incorporated by reference. A soft watermark can be 
inserted into the DSD signal by replacing selected bits of the output bitstream with bits of a 
watermark pattern. 

10 For instance, the sigma-delta modulator 40 illustrated in Fig. 5 may be used to 

insert a soft watermark into the DSD signal. This sigma-delta modulator 40 includes a bit 
modification circuit 45 for receiving the bits output firom the quantizer 44, The function of 
the quantizer 44 is to quantize the output of the fifth-order noise shaping filter into "l"s and 
"0"s. The bit modification circuit 45 may be configured to replace selected bits output firom 

15 flie quantizer 44, e.g., every 1 OO*'' bit, with bits corresponding to a bit pattern of the soft 
watermark. 

Alternatively, the soft watennark data may be represented by the number of bit 
periods between successive inverted bits. Thus, the bit modification circuit 45 may be 
configured to invert selected bits in the quantized bitstream. 

20 The insertion of a soft watermark results in a mere 1 dB increase of 

quantization noise, which is not audible in the 120 dB dynamic range of the DSD signal. 
While soft watermarks do not degrade the audio quality by any significant amount, they are 
easily removed fi'om a DSD signal by ahnost any type of signal processing. Therefore, soft 
watermarks are not considered robust However, such watermarks may still be useful, 

25 because the absence of a soft watennark in a DSD recording indicates that the audio signal is 
a copy. 

Accordingly, a soft watermark may be used in combination with a robust 
watermark, which is embedded in the DSD signal. A verification process can be used to 
determine whether the soft watermark is missing from the DSD content, i.e., whether the 
30 content has been copied. If so, the robust watermark can be used to identify the owner or 
rights holder of the content, so that legal restitution can be made. 

The above embodiments have been described with respect to a digital 
watermark embedded in a frequency range of a DSD signal that does not extend substantially 
below 20 kHz or substantially above 100 kHz, However, a watermark may be inserted in a 
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frequency range that extends below 20 kHz or above 100 kHz and not depart from the spirit 
and scope of the invention. For example, the watermark may be inserted in frequency 
components of the audio signal well below 20 kHz, at such short and infrequent intervals that 
it is not audible to a listener. 
5 The present invention has been described with reference to the exemplary 

embodiments. As will be evident to fliose of ordinary skill in the art, various modifications of 
this invention can be made ia light of the foregoing disclosure without departing from the 
spirit and scope of the claims. 
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1 . A method comprising: 

inserting a watermark into a Direct Stream Digital (DSD) signal to create a 
watermarked signal, said inserted watermark being spread across a frequency range of said 
watermarked signal. 

5 

2. The method of claim 1, wherein said frequency range does not extend 
substantially below 20 kHz, 

3 . The method of claim 1 , wherein said frequency range does not extend 
1 0 substantially above 1 00 kHz. 

4. The method of claim 1 , wherein said frequency range does not extend 
substantially below 20 kHz and does not extend substantially above 100 kHz. 

15 5. The method of claim 1, wherein said inserting step is perfomied using a spread 

spectrum watermarking process. 

6. The method of claim 1 , wherein prior to said inserting step, a soft watermark 
is inserted into said DSD signal by replacing selected bits of said DSD signal with bits 

20 corresponding to soft watermark data. 

7. A method comprising: 

inserting a watermark into an analog signal to create a watermarked signal, 
said inserted watermark being spread across a frequency range of said watermarked signal; 
25 converting said watermarked signal into a Direct Stream Digital (DSD) signal, 

8. A Direct Stream Digital (DSD) signal including an embedded watermark, 
which is spread across a frequency range of said DSD signal. 
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9. A data storage medium having stored thereon a DSD signal as claimed in 
claims. 

10. A recording apparatus (1 00) comprising: 

5 a modulator (40) for converting an input signal into a Direct Stream Digital 



(DSD) signal; 

a watermark embedder (20) for embedding a watermark into said DSD signal 
and outputting a watermarked signal, such that said watermark is spread across a frequency 
range of said watermarked signal; and 
10 a recorder (50) for recording said watermarked signal on a storage medium. 

1 1 . The recording apparatus (1 00) of claim 1 0, wherein said wateraiark embedder 
(20) utilizes a spread spectrum watermarking (SS W) technique, said watermark embedder 
(20) further comprising 

IS a pseudo noise generator (22) for gmerating pseudo noise; 

a modulator (24) for modulating said watermark with said pseudo noise to 
generate a modulated watermark signal; 

an amplifier (26) for scaling the modulated watermark signal; and 
an addition element (28) for combining the DSD signal with the scaled modulated watermark 
20 signal. 

12. A recording apparatus (100) comprising: 

a watermark embedder (20) for embedding a watermark into an analog signal 
and oulputting a watermarked signal, such that said watermark is spread across a frequency 
25 range of said watermarked signal; 

a modulator (40) for converting said watermarked signal into a Direct Stream 
Digital (DSD) signal; and 

a recorder (50) for recording said DSD signal on a storage medium. 
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